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NOVEL RECEPTOR 

Field of the Invention :- 

The present invention relates to isolated polynucleotide molecules 
5 which encode a novel seven-transmembrane G-protein-coupled receptor 
designated TSR32. In addition, the present invention relates to the use of 
these molecules in the production of TSR32 receptors using recombinant 
technology and to methods of screening compounds for agonists and/or 
antagonists of the TSR32 receptor. 

10 

Background of the Invention :- 

Proteins with seven-transmembrane (7TM) segments define a 
superfamily of receptors with a common structure comprising an 
extracellular N- terminus, three extramembranous loops on either side of the 

15 plasma membrane, and a cytoplasmic C-terminus. The majority of such 

proteins function as cell-surface receptors for a variety of ligands, including 
small molecules, peptides, hormones, ions, and external sensory stimuli such 
as odorants (Watson, S. and Arkinstall, S. t The G-protein Linked Receptor 
Facts Book, Academic Press, London, 1994). One family within the 7TM 

20 superfamily is known as the secretin receptor family and comprises member 
receptors with specificity for secretin, calcitonin, glucagon, glucagon-like 
peptide 1, parathyroid hormone, parathyroid-related peptide, corticotropin- 
releasing factor (CRF), growth hormone-releasing hormone (GHRH), gastric 
inhibitory polypeptide, pituitary adenylate-cyclase-activating polypeptide 

25 (PACAP), vasoactive intestinal peptides (VIP) and insect diuretic hormone 
(DHR). 

The present inventor has now identified a further member receptor of 
the secretin receptor family. This receptor, designated TSR32, appears to be 
expressed in a large variety of tissue types including heart, kidney, lung and 
30 brain with highest levels in the thyroid gland. Since agonists and/or 

antagonists for this receptor may have commercial value, for example as 
regulators of important thyroid functions in growth, development and 
metabolic activity, the ability to produce TSR32 receptors by recombinant 
DNA technology would be advantageous. 

35 
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Disclosure of the Invention :- 

Thus, in a first aspect, the present invention provides an isolated 
polynucleotide molecule encoding a G-protein-coupled receptor, designated 
TSR32, which is characterised by the N-terminal amino acid sequence- 

MTPQSLLQTT (SEQ ID NO: 1), 
or a functionally equivalent fragment of said receptor. 

Preferably, the polynucleotide molecule encodes a human TSR32 
receptor of about 693 amino acids. 

More preferably, the isolated polynucleotide molecule encodes a 
human TSR32 receptor comprising an amino acid sequence substantially 
corresponding to that shown as SEQ ID NO: 2 or a functionally equivalent 
fragment thereof. 

The nucleotide sequence of a polynucleotide molecule in accordance 
with the first aspect, may comprise a nucleotide sequence substantially 
corresponding to or, showing at least 75% (preferably, at least 90% or, even 
more preferably, at least 95%) sequence identity to that shown as SEQ ID 
NO: 3 or SEQ ID NO: 4 or any portion thereof encoding a functionally 
equivalent TSR32 receptor fragment. 

The polynucleotide molecule maybe incorporated into plasmids or 
expression vectors (including viral vectors), which may then be introduced 
into suitable host cells (e.g. bacterial, yeast, insect and mammalian host 
cells). Such host cells may be used to express the TSR32 receptor encoded 
by the isolated polynucleotide molecule. 

Accordingly, in a second aspect, the present invention provides a host 
cell transformed with the polynucleotide molecule of the first aspect. 

In a third aspect, the present invention provides a method of producing 
TSR32 receptors or functionally equivalent fragments thereof, comprising 
culturing the host cell of the second aspect under conditions enabling the 
expression of the polynucleotide molecule and, optionally, recovering the 
30 TSR32 receptor or fragments thereof. 

Preferably, the host cell is mammalian or of insect origin. Where the 
cell is mammalian, it is presently preferred that it be a Chinese hamster 
ovary (CHO) cell or a human embryonic kidney (HEK) 293 cell. Where the 
cell is of insect origin, it is presently preferred that it be an insect Sf9 cell. 

In a preferred embodiment, the TSR32 receptor or fragments thereof 
are expressed onto the surface of the host cells. 
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The polynucleotide molecules of the present invention represent a new 
subfamily of the secretin receptor family which may be of interest both 
clinically and commercially as it is expressed in many tissvie types and may 
therefore be involved in a wide variety of systems. 
5 By using the polynucleotide molecules of the present invention it is 

possible to obtain TSR32 receptor protein or functionally equivalent 
fragments thereof in a substantially pure form. 

Accordingly, in a fourth aspect, the present invention provides a G- 
protein-coupled receptor, designated TSR32, which is characterised by the N- 
10 terminal amino acid sequence: 

MTPQSLLQTT (SEQ ID NO: 1), 
or a functionally equivalent fragment thereof, in a substantially pure form. 

Preferably, the purified TSR32 receptor has an amino acid sequence 
substantially corresponding to that shown as SEQ ID NO: 2. 
15 In a fifth aspect the present invention provides an antibody or 

antibody fragment capable of specifically binding to a TSR32 receptor 
according to the fourth aspect. 

The antibody may be monoclonal or polyclonal, however, it is 
presently preferred that the antibody is a monoclonal antibody. Suitable 
20 antibody fragments include Fab, (F(ab') 2 and scFv. 

In a sixth aspect, the present invention provides a non-human animal 
transformed with a polynucleotide molecule according to the first aspect of 
the present invention. 

In a seventh aspect, the present invention provides a method for 
25 detecting agonist and/or antagonist compounds of the TSR32 receptor, 

comprising contacting a TSR32 receptor, functionally equivalent fragment 
thereof or a host cell transformed with and expressing the polynucleotide 
molecule of the first aspect, with a test compound under conditions enabling 
the activation of the TSR32 receptor or functionally equivalent fragment 
30 thereof, and detecting an increase or decrease in the activity of the TSR32 
receptor or functionally equivalent fragment thereof. 

An increase or decrease in activity of the receptor or functionally 
equivalent fragment thereof may be detected by measuring changes in CAMP 
production, CA 2+ levels or IP3 turnover after activating the receptor or 
35 fragment thereof with specific agonists or antagonists. 
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In a further aspect, the present invention provides an oligonucleotide 
or polynucleotide probe comprising a nucleotide sequence of 10 or more 
nucleotides, the probe comprising a nucleotide sequence such that the probe 
specifically hybridises to the polynucleotide molecule of the first aspect 
under high stringency conditions (Sambrook et al., Molecular Cloning: a 
laboratory manual. Second Edition, Cold Spring Harbor Laboratory Press). 
Preferably the probe is labelled. 

In a still further aspect, the present invention provides an antisense 
polynucleotide molecule comprising a nucleotide sequence capable of 
specifically hybridising to an mRNA molecule which encodes a TSR32 
receptor so as to prevent translation of the mRNA molecule. 

Such antisense polynucleotide molecules may include a ribozyme 
region to catalytically inactivate mRNA to which it is hybridized. 

The polynucleotide molecule of the first aspect of the invention may 
be a dominant negative mutant which encodes a gene product causing an 
altered phenotype by, for example, reducing or eliminating the activity of 
endogenous TSR32 receptors. 

The term "substantially corresponding" as used herein in relation to 
amino acid sequences is intended to encompass minor variations in the 
amino acid sequences which do not result in a decrease in the biological 
activity of the TSR32 receptor. These variations may include conservative 
amino acids substitutions. The substitutions envisaged are:- 

G, A, V, I, L, M; D, E; N, Q; S, T; K, R, H; F. Y, W, H; and 
P, Na-alkalamino acids. 

The term "substantially corresponding" as used herein in relation to 
nucleotide sequences is intended to encompass minor variations in the 
nucleotide sequence which due to degeneracy in the DNA code do not result 
in a change in the encoded protein. Further, this term is intended to 
encompass other minor variations in the sequence which may be required to 
enhance expression in a particular system but in which the variations do not 
result in a decrease and biological activity of the encoded protein. 

The term "functionally equivalent fragment/s" as used herein is 
intended to refer to fragments of the TSR32 receptor that exhibit binding 
specificity and activity that is substantially equivalent to the full length 
35 receptor from which it/they is/are derived. 
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The terms "comprise", "comprises" and "comprising" as used 
throughout the specification are intended to refer to the inclusion of a stated 
step, component or feature or group of steps, components or features with or 
without the inclusion of a further step, component or feature or group of 
5 steps, components or features. 

Brief description of the figures :- 

Figure 1 provides the nucleotide sequence of a cDNA encoding the human 
10 TSR32 receptor and includes the predicted amino acid sequence. 

Figure 2 provides a sequence alignment of the amino acid sequences of the 
human TSR32 receptor and the human epididymis-specific HE6 receptor 
(Osterhoff, C, DNA and Cell Biology, Vol. 16 No. 4, 1997). 



15 



Example :- 

Degenerative oligonucleotides corresponding to conserved regions 
within the human Glucagon-like peptide receptor (GLPl) family were used to 
amplify specific DNA sequences from a variety of cDNA library DNA's. 
20 Library DNA from human hypothalamus and heart yielded fragments of the 
expected size. These fragments were subcloned and sequenced. One 
fragment, 456bp long, revealed a novel 7TM receptor sequence distantly 
related to the secretin receptor family. Screening a human heart cDNA 
library (Stratagene) using this fragment under high stringency conditions 
25 identified two positive hybridising clones, 1 and 2.9kb in size, respectively. 
The larger clone contains an open reading frame of 2079 nucleotides 
encoding a 693 amino acid long protein (designated TSR32). The nucleotide 
and putative amino acid sequence is provided at Figure 1. 

A search of the EMBL/GenBank database revealed low but significant 
30 sequence similarities to the secretin receptor family of TM7 receptors. The 
sequence similarity to the above receptor sequences is only about 25% 
identity and also limited to the transmembrane region. However, the highest 
overall sequence similarity was found to four other G-protein coupled 
receptor subtypes, the human epididymis-specific HE6 (31% identity), the 
human lymphocyte antigen CD97 (33% identity), the orphan human receptor 
EMR1 (27% identity), the insect DHR (26% identity) and the rat latrophilin- 



35 
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related protein 1 precursor (35% identity) all of which share a large 
extracellular domain with numerous putative O- and N-glycosylation sites 
Except for DHR, in all of the receptors there is a cysteine motif preceding the 
first transmembrane domain. Further, in contrast to the CD97 and EMRl 
receptors, TSR32 does not contain any EGF-domains or calcium binding sites 
in the extracellular domain. 

Interestingly, at the gene level the exon/intron organisation of the 
TSR32-encoding cDNA. at least within the region encoding the 
transmembrane domain, is most similar to the gene structure of the PACAP 
receptor family, with fewer but identical exon/intron borders. 

Northern analysis indicates a low level of expression of the receptor 
niRNA in a large variety of tissues including kidney, lung, cerebellum and 
heart. The highest level of mRNA however, was found in the thyroid gland. 
This expression pattern of the receptor mRNA indicates that TSR32 has 
important functions in metabolic regulations throughout the body via the 
thyroid gland. 

The gene for TSR32 has been mapped to human chromosome 16q31 by 
in situ hybridisation and this localisation was confirmed by radiation hybrid 
analysis. An autosomal recessive disorder (Bardet Biedl Syndrome; Kwitek- 
Black, A.E. et al., Nature Genetics 5: 392-396, 1993) has also been linked to 
this region. Features of this syndrome include obesity, retinal degeneration 
hypogonadism and mental retardation. The expression of TSR32 examined 
by in situ hybridisation and Northern analysis overlaps strongly with the 
tissues affected in the Bardet Biedl Syndrome making TSR32 a good 
candidate for this locus. 

It will be appreciated by persons skilled in the art that numerous 
variations and/or modifications may be made to the invention as shown in 
the specific embodiments without departing from the spirit or scope of the 
invention as broadly described. The present embodiments are, therefore, to 
be considered in all respects as illustrative and not restrictive. 
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Sequence listings:- 



Applicant: Garvan Institute of Medical Research 
Title of Invention: Novel receptor 
Number of SEQ ID Nos : 4 



SEQ ID NO: 1 
Length: 10 
Type PRT 

Organism: Homo sapiens 
Sequence: 1 

Met Thr Pro Gin Ser Leu Leu Gin Thr Thr 
15 10 



SEQ ID NO: 2 
Length 693 
Type PRT 

Organism: Homo sapiens 
Sequence: 2 

Met Thr Pro Gin Ser Leu Leu Gin Thr Thr Leu Phe Leu Leu Ser Leu 
15 10 15 

Leu Phe Leu Val Gin Gly Ala His Gly Arg Gly His Arg Glu Asp Phe 
20 25 30 

Arg Phe Cys Ser Gin Arg Asn Gin Thr His Arg Ser Ser Leu His Tyr 
35 40 45 

Lys Pro Thr Pro Asp Leu Arg lie Ser lie Glu Asn Ser Glu Glu Ala 
50 55 60 

Leu Thr Val His Ala Pro Phe Pro Ala Ala His Pro Ala Ser Arg Ser 
65 70 75 80 

Phe Pro Asp Pro Arg Gly Leu Tyr His Phe Cys Leu Tyr Trp Asn Arg 

85 90 95 

His Ala Gly Arg Leu His Leu Leu Tyr Gly Lys Arg Asp Phe Leu Leu 
100 105 110 

Ser Asp Lys Ala Ser Ser Leu Leu Cys Phe Gin His Gin Glu Glu Ser 
115 120 125 

Leu Ala Gin Gly Pro Pro Leu Leu Ala Thr Ser Val Thr Ser Trp Trp 
130 135 140 



Ser Pro Gin Asn lie Ser Leu Pro Ser Ala Ala Ser Phe Thr Phe S 



er 
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145 150 



155 160 



Phe His Ser Pro Pro His Thr Ala Ala His Asn Ala Ser Val Asp Met 

165 170 175 

Cys Glu Leu Lys Arg Asp Leu Gin Leu Leu Ser Gin Phe Leu Lys His 
180 185 190 

Pro Gin Lys Ala Ser Arg Arg Pro Ser Ala Ala Pro Ala Ser Gin Gin 
195 200 205 

Leu Gin Ser Leu Glu Ser Lys Leu Thr Ser Val Arg Phe Met Gly Asp 
210 215 220 

Met Val Ser Phe Glu Glu Asp Arg He Asn Ala Thr Val Trp Lys Leu 
225 230 235 240 

Gin Pro Thr Ala Gly Leu Gin Asp Leu His He His Ser Arg Gin Glu 

245 250 255 

Glu Glu Gin Ser Glu He Met Glu Tyr Ser Val Leu Leu Pro Arg Thr 
260 265 270 

Leu Phe Gin Arg Thr Lys Gly Arg Ser Gly Glu Ala Glu Lys Arg Leu 
275 280 285 

Leu Leu Val Asp Phe Ser Ser Gin Ala Leu Phe Gin Asp Lys Asn Ser 
290 2 95 300 

Ser Gin Val Leu Gly Glu Lys Val Leu Gly He Val Val Gin Asn Thr 
305 310 315 320 

Lys Val Ala Asn Leu Thr Glu Pro Val Val Leu Thr Phe Gin His Gin 

325 330 335 

Leu Gin Pro Lys Asn Val Thr Leu Gin Cys Val Phe Trp Val Glu Asp 
340 345 350 

Pro Thr Leu Ser Ser Pro Gly His Trp Ser Ser Ala Gly Cys Glu Thr 
355 360 365 

Val Arg Arg Glu Thr Gin Thr Ser Cys Phe Cys Asn His Leu Thr Tyr 
370 375 380 

Phe Ala Val Leu Met Val Ser Ser Val Glu Val Asp Ala Val His Lys 
385 390 395 400 

His Tyr Leu Ser Leu Leu Ser Tyr Val Gly Cys Val Val Ser Ala Leu 

405 410 415 

Ala Cys Leu Val Thr He Ala Ala Tyr Leu Cys Ser Arg Val Pro Leu 
420 425 430 

Pro Cys Arg Arg Lys Pro Arg Asp Tyr Thr He Lys Val His Met Asn 
435 440 445 
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Leu Leu Leu Ala Val Phe Leu Leu Asp Thr Ser Phe Leu Leu Ser Glu 
450 455 460 

Pro Val Ala Leu Thr Gly Ser Glu Ala Gly Cys Arg Ala Ser Ala lie 
465 470 475 480 

Phe Leu His Phe Ser Leu Leu Thr Cys Leu Ser Trp Met Gly Leu Glu 

485 490 495 

Gly Tyr Asn Leu Tyr Arg Leu Val Val Glu Val Phe Gly Thr Tyr Val 
500 505 510 

Pro Gly Tyr Leu Leu Lys Leu Ser Ala Met Gly Trp Gly Phe Pro lie 
515 520 525 

Phe Leu Val Thr Leu Val Ala Leu Val Asp Val Asp Asn Tyr Gly Pro 
530 535 540 

lie lie Leu Ala Val His Arg Thr Pro Glu Gly Val lie Tyr Pro Ser 
545 550 555 560 

Met Cys Trp lie Arg Asp Ser Leu Val Ser Tyr lie Thr Asn Leu Gly 

565 570 575 

Leu Phe Ser Leu Val Phe Leu Phe Asn Met Ala Met Leu Ala Thr Met 
580 585 590 

Val Val Gin lie Leu Arg Leu Arg Pro His Thr Gin Lys Trp Ser His 
595 600 605 

Val Leu Thr Leu Leu Gly Leu Ser Leu Val Leu Gly Leu Pro Trp Ala 
610 615 620 

Leu lie Phe Phe Ser Phe Ala Ser Gly Thr Phe Gin Leu Val Val Leu 
625 630 635 640 

Tyr Leu Phe Ser He He Thr Ser Phe Gin Gly Phe Leu He Phe He 

645 650 655 

Trp Tyr Trp Ser Met Arg Leu Gin Ala Arg Gly Gly Pro Ser Pro Leu 
660 665 670 

Lys Ser Asn Ser Asp Cys Ala Arg Leu Pro He Ser Ser Gly Ser Thr 
675 680 685 

Ser Ser Ser Arg He 
690 
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SEQ ID NO: 3 
Length 2082 
Type DNA 

Organism: Homo sapiens 
Sequence : 3 

atgactcccc agtcgctgct gcagacgaca 
caaggtgccc acggcagggg ccacagggaa 
acacacagga gcagcctcca ctacaaaccc 
tccgaagagg ccctcacagt ccatgcccct 
ttccctgacc ccaggggcct ctaccacttc 
ttacatcttc tctatggcaa gcgtgacttc 
tgcttccagc accaggagga gagcctggct 
acctcctggt ggagccctca gaacatcagc 
ttccacagtc ctccccacac ggccgctcac 
agggacctcc agctgctcag ccagttcctg 
tcggctgccc ccgccagcca gcagttgcag 
ttcatggggg acatggtgtc cttcgaggag 
cagcccacag ccggcctcca ggacctgcac 
gagatcatgg agtactcggt gctgctgcct 
agcggggagg ctgagaagag actcctcctg 
gacaagaatt ccagccaagt cctgggtgag 
aaagtagcca acctcacgga gcccgtggtg 
aatgtgactc tgcaatgtgt gttctgggtt 
tggagcagtg ctgggtgtga gaccgtcagg 
cacttgacct actttgcagt gctgatggtc 
cactacctga gcctcctctc ctacgtgggc 
accattgccg cctacctctg ctccagggtg 
tacaccatca aggtgcacat gaacctgctg 
ctgctcagcg agccggtggc cc tgacaggc 
ttcctgcact tctccctgct cacctgcctt 
taccgactcg tggtggaggt ctttggcacc 
gccatgggct ggggcttccc catctttctg 
aactatggcc ccatcatctt ggctgtgcat 
atgtgctgga tccgggactc cctggtcagc 
gtgtttctgt tcaacatggc catgctagcc 
ccccacaccc aaaagtggtc acatgtgctg 
ctgccctggg ccttgatctt cttctccttt 
taccttttca gcatcatcac ctccttccaa 
atgcggctgc aggcccgggg tggcccctcc 
ctccccatca gctcgggcag cacctcgt cc 



ctgttcctgc tgagtctgct cttcctggtc 60 
gactttcgct tctgcagcca gcggaaccag 120 
acaccagacc tgcgcatctc catcgagaac 180 
ttccctgcag cccaccctgc ttcccgatcc 240 
tgcctctact ggaaccgaca tgctgggaga 300 
ttgctgagtg acaaagcctc tagcctcctc 360 
cagggccccc cgctgttagc cacttctgtc 420 
ctgcccagtg ccgccagctt caccttctcc 480 
aatgcctcgg tggacatgtg cgagctcaaa 540 
aagcatcccc agaaggcctc aaggaggccc 600 
agcctggagt cgaaactgac ctctgtgaga 660 
gaccggatca acgccacggt atggaagctc 720 
atccactccc ggcaggagga ggagcagagc 780 
cgaacactct tccagaggac gaaaggccgg 840 
gtggacttca gcagccaagc cctgttccag 900 
aaggtcttgg ggattgtggt acagaacacc 960 
ctcactttcc agcaccagct acagccgaag 1020 
gaagacccca cattgagcag cccggggcat 1080 
agagaaaccc aaacatcctg cttctgcaac 1140 
tcctcggtgg aggtggacgc cgtgcacaag 1200 
tgtgtcgtct ctgccctggc ctgccttgtc 1260 
cccctgccgt gcaggaggaa acctcgggac 1320 
ctggccgtct tcctgctgga cacgagcttc 1380 
tctgaggctg gctgccgagc cagtgccatc 1440 
tcctggatgg gcctcgaggg gtacaacctc 1500 
tatgtccctg gctacctact caagctgagc 1560 
gtgacgctgg tggccctggt ggatgtggac 1620 
aggactccag agggcgtcat ctacccttcc 1680 
tacat caeca acctgggcct cttcagcctg 17 4 0 
accatggtgg tgeagatect gcggctgcgc 180.0 
acactgetgg gcctcagcct ggtccttggc 1860 
gcttctggca ccttccagct tgtcgtcctc 1920 
ggcttcctca tcttcatctg gtactggtcc 1980 
cctctgaaga gcaactcaga ctgcgccagg 2 04 0 
agccgcatct ag 2082 



SEQ ID NO: 4 
Length 2834 
Type DNA 

Organism: Homo sapiens 
Sequence: 4 

gaattcegge agcagggtct cgctctgtca 
getcategta acctccacct cccgggttca 
gctgggatta caggtggtga cttccaagag 
tcgctgctgc agacgacact gttcctgctg 
ggcaggggee acagggaaga ctttcgcttc 



cacaggctgg agtgcagtgg tgtgatcttg 60 

agtgattctc atgcctcagc ctcccgagta 12 0 

tgactccgtc ggaggaaaat gactccccag 180 

agtctgetet tcctggtcca aggtgcccac 240 

tgcagccagc ggaaccagac acacaggagc 300 
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agcct ccact 
ctcacagtcc 
aggggcctct 
tatggcaagc 
caggaggaga 
agccctcaga 
ccccacacgg 
ctgct cagcc 
gccagccagc 
atggtgtcct 
ggcctccagg 
tactcggtgc 
gagaagagac 
agccaagtcc 
ctcacggagc 
caatgtgtgt 
ggg tgtgaga 
tttgcagtgc 
ctcctctcct 
tacctctgct 
gtgcacatga 
ccggtggccc 
tccctgctca 
gtggaggtct 
ggcttcccca 
atcatcttgg 
cgggactccc 
aacatggcca 
aagtggtcac 
ttgatcttct 
atcatcacct 
gcccggggtg 
tcgggcagca 
cagagatgcg 
agtcagccgc 
gacggactcc 
gctcccacgg 
cccaacccag 
gggccaggcc 
cagaaatgtg 
ctgtcatttt 
ccagagccct 
tgagcccgga 



acaaacccac 
atgcccctt t 
accacttctg 
gtgacttctt 
gcctggctca 
acat cagcct 
ccgct cacaa 
agttcctgaa 
agttgcagag 
tcgaggagga 
acctgcaca t 
tgctgcctcg 
tcctcctggt 
tgggtgagaa 
ccgtggtgct 
tctgggttga 
ccg tcaggag 
tgatggtctc 
acgtgggctg 
ccagggtgcc 
acctgctgct 
tgacaggctc 
cctgcctttc 
ttggcaccta 
tctttctggt 
ctgtgcatag 
tggtcagcta 
tgctagccac 
atgtgctgac 
tctcctttgc 
cct tccaagg 
gcccct cccc 
cct cgtccag 
gcctcgtcgc 
agactttgga 
cggggctggg 
gactcagaag 
ctggaggcct 
ttggatcttg 
gctccagttg 
aacctcaggt 
ggcggaggag 
attc 



accagacctg 
ccctgcagcc 
cc tctactgg 
gctgagtgac 
gggccccccg 
gcccagtgcc 
tgcctcggtg 
gcatccccag 
cctggagtcg 
ccggatcaac 
ccactcccgg 
aacactcttc 
ggacttcagc 
ggtcttgggg 
cactttccag 
agaccccaca 
agaaacccaa 
ctcggtggag 
tgtcgtctct 
cctgccgtgc 
ggccgtcttc 
tgaggctggc 
c tggatgggc 
tgtccctggc 
gacgctggtg 
gactccagag 
catcaccaac 
catggtggtg 
actgctgggc 
t tctggcacc 
cttcctcatc 
tctgaagagc 
ccgcatctag 
acactgcctg 
aagcccaacg 
gcttttgaat 
tgcgccgcca 
ggtctctcct 
agggtctggc 
ctctgtctct 
ggcacccagg 
aggccctttg 



cgcatct cca 
caccctgctt 
aaccgacatg 
aaagcct eta 
ctgttagcca 
gccagcttca 
gacatgtgcg 
aaggcctcaa 
aaactgacct 
gecaeggtat 
caggaggagg 
cagaggacga 
agccaagccc 
attgtggtac 
caccagctac 
ttgagcagee 
acatcctgct 
gtggacgccg 
gccctggcct 
aggaggaaac 
ctgctggaca 
tgccgagcca 
ctcgaggggt 
tacctactca 
gccctggtgg 
ggegtcatet 
ctgggcctct 
cagatcctgc 
ctcagcctgg 
ttccagcttg 
ttcatctggt 
aactcagact 
gcctccagcc 
tggcccccga 
accatggaga 
tggccttggg 
tgetgectag 
tacaacccct 
acatccttaa 
cgtggtcacc 
gcgaatgggg 
ccaggagcac 



tcgagaact c 
cccgatcc tt 
ctgggaga t t 
gcctcct ctg 
ct tctgt cac 
ccttctcct t 
agctcaaaag 
ggaggcect c 
ctgtgaga tt 
ggaagctcca 
ageagagega 
aaggceggag 
tgttccagga 
agaacaccaa 
agecgaagaa 
eggggcattg 
tctgcaacca 
tgeacaagea 
gect tgt cac 
ctegggacta 
cgagcttcct 
gtgecatett 
acaacctcta 
agetgagege 
atgtggacaa 
acccttccat 
tcagcctggt 
ggctgcgccc 
tcct tggect 
tcgtcctcta 
actggtccat 
gcgccaggct 
cacctgccca 
gccaggccca 
ga tgggccgt 
gaetactegg 
ggtactgt cc 
gggcccagcc 
tcctgtgccc 
ctgagggcac 
cccagggcag 
agcagcagct 



egaagaggee 360 

ccctgacccc 420 

acatcttctc 480 

cttccagcac 540 

ctcctggtgg 600 

ccacagtcct 660 

ggacctccag 720 

ggctgccccc 780 

catgggggac 840 

gcccacagcc 900 

gatcatggag 960 

eggggagget 1020 

caagaattcc 1080 

agtagecaac 1140 

tgtgactctg 1200 

gagcagtget 1260 

cttgacctac 1320 

ctacctgagc 1380 

cattgccgcc 1440 

caccatcaag 1500 

getcagegag 1560 

cctgcacttc 1620 

ccgactcgtg 1680 

catgggctgg 1740 

ctatggcccc 1800 

gtgctggatc 1860 

gtttctgttc 1920 

ccacacccaa 1980 

gccctgggcc 2040 

ccttttcagc 2100 

gcggctgcag 2160 

ccccatcagc 2220 

tgtgatgaag 2280 

gccccaggcc 2340 

tgccatggtg 2400 

ctctcactca 246.0 

ccacatctgt 2520 

teattgetgg 2580 

ctgcctggga 2640 

tctgcatcct 2700 

accttcaggg 2760 

cgcctacctc 2820 
2834 
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Claims: 

1. An isolated polynucleotide molecule encoding a G-protein-coupled 
receptor which is characterised by the N-terminal amino acid sequence- 

5 MTPQSLLQTT (SEQ ID NO: 1), 

or a functionally equivalent fragment of said receptor. 

2. A polynucleotide molecule according to claim 1, wherein said 
polynucleotide molecule encodes a G-protein-coupled receptor of human 

10 origin of about 693 amino acids in length. 

3. A polynucleotide molecule according to claim 2, wherein the 
polynucleotide molecule encodes a G-protein-coupled receptor comprising an 
amino acid sequence substantially corresponding to that shown as SEQ ID 



20 



25 



30 



35 



4. An isolated polynucleotide molecule encoding a G-protein-coupled 
receptor, wherein the polynucleotide molecule comprises a nucleotide 
sequence showing at least 75% sequence identity to that shown as SEQ ID 
NO: 3 or SEQ ID NO: 4 or any portion thereof encoding a functionally 
equivalent receptor fragment. 

5. A polynucleotide molecule according to claim 4, wherein the 
polynucleotide molecule comprises a nucleotide sequence showing at least 
90% sequence identity to that shown as SEQ ID NO: 3 or SEQ ID NO- 4 or 
any portion thereof encoding a functionally equivalent receptor fragment. 

6. A polynucleotide molecule according to claim 5 , wherein the 
polynucleotide molecule comprises a nucleotide sequence showing at least 
95% sequence identity to that shown as SEQ ID NO: 3 or SEQ ID NO: 4 or 
any portion thereof encoding a functionally equivalent receptor fragment. 

7. A plasmid or expression vector including a polynucleotide molecule 
according to any one of the preceding claims. 
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8. A host cell transformed with a polynucleotide molecule according to 
any one of claims 1 to 6. 

9. A host cell according to claim 8, wherein the cell is a mammalian or 
5 insect cell. 

10. A host cell according to claim 9, wherein the cell is a Chinese hamster 
ovary (CHO) cell, human embryonic kidney (HEK) 293 cell or an insect Sf9 
cell. 

10 

11. A host cell according to any one of claims 8 to 10 wherein the cell 
expresses the G-protein-coupled receptor or functionally equivalent fragment 
thereof onto the cell's surface. 

15 12- A G-protein-coupled receptor which is characterised by the N-terminal 
amino acid sequence: 

MTPQSLLQTT (SEQ ID NO: 1), 
or a functionally equivalent fragment of said receptor, in a substantially pure 
form. 

20 

13. A receptor according to claim 12 ; wherein said receptor is a human 
receptor of about 693 amino acids. 

14. A receptor according to claim 13, wherein the receptor has an amino 
25 acid sequence substantially corresponding to that shown as SEQ ID NO: 2. 

15. An antibody or fragment thereof which specifically binds to a G- 
protein-coupled receptor according to any one of claims 12 to 14. 

30 16. A non-human animal transformed with a polynucleotide molecule 
according to any one of claims 1 to 6. 

17. A method for detecting agonist or antagonist agents of a G-protein- 
coupled receptor comprising, contacting a G-protein-coupled receptor 
35 according to any one of claims 12 to 14 or a host cell transformed with and 
expressing a DNA molecule according to any one of claims 1 to 6, with a test 
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agent under conditions enabling the activation of the receptor, and detecting 
an increase or decrease in the receptor activity. 

18. An oligonucleotide or polynucleotide probe comprising a nucleotide 
sequence of 10 or more nucleotides, the probe comprising a nucleotide 
sequence such that the probe specifically hybridises to the polynucleotide 
molecule of any one of claims 1 to 6 under high stringency conditions. 

19. An antisense polynucleotide molecule comprising a nucleotide 
sequence capable of specifically hybridising to an mRNA molecule which 
encodes a G-protein-coupled receptor encoded by the polynucleotide 
molecule of any one of claims 1 to 6, so as to prevent translation of the 
mRNA molecule. 



20. A method of producing G-protein-coupled receptors or functionally 
equivalent fragments thereof according to any one of claims 12 to 14. 
comprising culturing a host cell according to any one of claims 8 to 11 under 
conditions enabling the expression of the polynucleotide molecule and 
optionally recovering the receptors or functionally equivalent fragments 
thereof. 
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FIGURE 1 1/5 

70 

* ****** 

GAATTCCGGCAGCAGGGTCTCGCTCTGTCACACAGGCTGGAGTGCAGTGGTGTGATCTTGGCTCATCGTA 

140 

* ****** 

ACCTCCACCTCCCGGGTTCAAGTGATTCTCATGCCTCAGCCTCCCGAGTAGCTGGGATTACAGGTGGTGA 

210 

******* 

CTTCCAAGAGTGACTCCGTCGGAGGAAAATGACTCCCCAGTCGCTGCTGCAGACGACACTGTTCCTGCTG 

MTPQSLLQTTLFL L> 

280 

* ****** 

AGTCTGCTCTTCCTGGTCCAAGGTGCCCACGGCAGGGGCCACAGGGAAGACTTTCGCTTCTGCAGCCAGC 
SLLFLVQGAHGRGHREDFRFCS Q> 

350 

* ****** 

GGAACCAGACACACAGGAGCAGCCTCCACTACAAACCCACACCAGACCTGCGCATCTCCATCGAGAACTC 
R N Q T H RS SLHYK PT P DLR I S I EN S> 

420 

* * * * * * * 

CGAAGAGGCCCTCACAGTCCATGCCCCTTTCCCTGCAGCCCACCCTGCTTCCCGATCCTTCCCTGACCCC 
EEALTVHAPFPAAH PASRS FPDP> 

490 

******* 

AGGGGCCTCTACCACTTCTGCCTCTACTGGAACCGACATGCTGGGAGATTACATCTTCTCTATGGCAAGC 
RGLYH FCLYWNRHAGRLHLLYGK> 

550 

* ****** 

GTGACTTCTTGCTGAGTGACAAAGCCTCTAGCCTCCTCTGCTTCCAGCACCAGGAGGAGAGCCTGGCTCA 
RDFLLSDKAS SLLCFQHQEESLA Q> 

630 

* ****** 

GGGCCCCCCGCTGTTAGCCACTTCTGTCACCTCCTGGTGGAGCCCTCAGAACATCAGCCTGCCCAGTGCC 
GPPLLATSVTSWWS PQNISLPSA> 

700 

******* 

GCCAGCTTCACCTTCTCCTTCCACAGTCCTCCCCACACGGCCGCTCACAATGCCTCGGTGGACATGTGCG 
AS FTF SFHS PPHTAAHNASVDM C> 

770 

* # * * * * * 

AGCTCAAAAGGGACCTCCAGCTGCTCAGCCAGTTCCTGAAGCATCCCCAGAAGGCCTCAAGGAGGCCCTC 
ELKRDLQLLSQFLKHPQKASRRPS> 

840 

******* 

GGCTGCCCCCGCCAGCCAGCAGTTGCAGAGCCTGGAGTCGAAACTGACCTCTGTGAGATTCATGGGGGAC 
AAPASQQLQSLESKLTSVRFMG D> 

910 
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ATGGTGTCCTTCGAGGAGGACCGGATCAACGCCACGGTATGGAAGCTCCAGCCCACAGCCGGCCTCCAGG 
MVSFEEDRINATVWKLQPTAG LQ> 

980 

***** * * 

ACCTGCACATCCACTCCCGGCAGGAGGAGGAGCAGAGCGAGATCATGGAGTACTCGGTGCTGCTGCCTCG 
DLHIHSRQEEEQSEIMEYSVLLPR> 

1050 

***** * * 

AACACTCTTCCAGAGGACGAAAGGCCGGAGCGGGGAGGCTGAGAAGAGACTCCTCCTGGTGGACTTCAGC 
TLFQRTKGRSGEAEKRLL LVDF S> 

1120 

***** * * 

AGCCAAGCCCTGTTCCAGGACAAGAATTCCAGCCAAGTCCT 
SQALFQDKNSSQVLGEKVLG IV V> 

1190 

***** * * 

AGAACACCAAAGTAGCCAACCTCACGGAGCCCGTGGTGCTCACTTTCCAGCACCAGCTACAGCCGAAGAA 
QNTKVANLTEPVVLTFQHQLQPK N> 

1260 

***** * « 

TGTGACTCTGCAATGTGTGTTCTGGGTTGAAGACCCCACATTGAGCAGCCCGGGGCATTGGAGCAGTGCT 
VTLQCVFWV EDPTLSSPGHWS5A> 

1330 

***** * * 

GGGTGTGAGACCGTCAGGAGAGAAACCCAAACATCCTGCTTCTGCAACCACTTGACCTACTTTGCAGTGC 

gcetvrretqtscfcnh.lt Y F A V> 

1400 

***** * * 

TGATGGTCTCCTCGGTGGAGGTGGACGCCGTGCACAAGCACTACCTGAGCCTCCTCTCCTACGTGGGCTG 
LM7S S V EVDAVH KHY L S LL S YVGC> 

1470 

* * * * * * * 

TGTCGTCTCTGCCCTGGCCTGCCTTGTCACCATTGCCGCCTACCTCTGCTCCAGGGTGCCCCTGCCGTGC 
VVSALACLVTIAAYLCSRVPLPC> 

1540 

***** * * 

AGGAGGAAACCTCGGGACTACACCATCAAGGTGCACATGAACCTGCTGCTGGCCGTCTTCCTGCTGGACA 
RRKPRDYTIKVHMNLLLAVFLL D> 

1610 

***** * * 

CGAGCTTCCTGCTCAGCGAGCCGGTGGCCCTGACAGGCTCTGAGGCTGGCTGCCGAGCCAGTGCCATCTT 
TS F L L S EPVALTGS EAGC R A SA-.I F> 

1680 

* * * * * * * 
CCTGCACTTCrCCCTGCrrCACCTGCCTT^CCT^ 

LHFS L LTCLSWMGLEGYN LY RL V> 

1750 

* * * « * t, * 
GTGGAGGTCTTTGGCACCTATGTCCCTGGCTACCTACrrCAAGCTGAGCGCCATGGGCTGGGGCTTCCCCA 

VEVFGTYVP GYLLKLSAMGWGFP> 
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1820 

******«, 

TCTTTCTGGTGACGCTGGTGGCCCTGGTGGATGTGGACAACTATGGCCCCATCATCTTGGCTGTGCATAG 
IFLVTLVALVDVDNYGPI I L A V H R> 

1890 

* * * * * * * ' 

GACTCCAGAGGGCGTCATCTACCCTTCCATGTGCTGGATCCGGGACTCCC1TGGTCAGCTACATCACCAAC 
TPEGVIYPSMCWIRDSLVSYITN> 

1960 

***** * * 

CTGGGCCTCTTCAGCCTGGTGTTTCTGTTCAACATGGCCATGCTAGCCACCATGGTGGTGCAGATCCTGC 
LGLFS LVFLFNMAMLATMVVQI L> 

2030 

******* 

GGCTGCGCCCCCACACCCAAAAGTGGTCACATGTGCTGACACTGCTGGGCCTCAGCCTGGTCCTTGGCCT 
RLRPHTQKWSHVLTLLGLSLVLG L> 

2100 

******* 

GCCCTGGGCCTTGATCTTCTTCTCCTTTGCTTCTGGCACCTTCCAGCTTGTCGTCCTCTACCTTTTCAGC 
PWALIFFSFASGTFQLVVLYLF S> 

2170 

***** * + 

ATCATCACCTCCTTCCAAGGCTTCCTCATCTTCATCTGGTACTGGTCCATGCGGCTGCAGGCCCGGGGTG 
I I TS FQGFL I F I WYWS MR L Q A . R G> 

2240 

****■» * + 

GCCCCTCCCCTCTGAAGAGCAACTCAGACTGCGCCAGGCTCCCCATCAGCTCGGGCAGCACCTCGTCCAG 
G?5PLK5N r SDCARLPISSG5TSS 3> 

2310 

***** * * 

CCGCATCTAGGCCTCCAGCCCACCTGCCCATGTGATGAAGCAGAGATGCGGCCTCGTCGCACACTGCCTG 
R I> 

2380 

* * * * * * * 

TGGCCCCCGAGCCAGGCCCAGCCCCAGGCCAGTCAGCCGCAGACTTTGGAAAGCCCAACGACCATGGAGA 

2450 

******* 

GATGGGCCGTTGCCATGGTGGACGGACTCCCGGGGCTGGGGCTTTTGAATTGGCCTTGGGGACTACTCGG 

2520 

* * * * * • * * 
CTCTCACTCAGCTCCCACGGGACTCAGAAGTGCGCCGCCATGCTGCCTAGGGTACTGTCCCCACATCTGT 

2590 

***** * * 

CCCAACCCAGCTGGAGGCCTGGTCTCTCCTTACAACCCCTGGGCCCAGCCTCATTGCTGGGGGCCAGGCC 

2660 

* * ft * * * * 
TTGGATCTTGAGGGTCTGGCACATCCTTAATCCTGTGCCCCTGCCTGGGACAGAAATGTGGCTCCAGTTG 
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CTCTGTCTCTCGTGGTCACCCTGAGGGCACTCTGCATCCTCTGTCATTTTAACCTCAGGTGGCACCCAGG 

2800 

***** * 

GCGAATGGGGCCCAGGGCAGACCTTCAGGGCCAGAGCCCTGGCGGAGGAGAGGCCCTTTGCCAGGAGCAC 

* * * 

AGCAGCAGCTCGCCTACCTCTGAGCCCGGAATTC 
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